A Connectionist View on Document Classification
نویسنده
چکیده
ion and Object-Oriented Programming in C++. John Wiley & Sons. New York. 1990.[7] K. E. Gorlen. NIH Class Library ReferenceManual (Revision 3.10). National Institutes ofHealth. Bethesda, MD. 1990.[8] E. R. Kandel, S. A. Siegelbaum, and J. H.Schwartz. Synaptic Transmission. in: Principlesof Neural Science (E. R. Kandel, J. H. Schwartz,and T. M. Jessel, Eds.). Elsevier. New York.1991.[9] E.-A. Karlsson, S. Sørumgård, and E.Tryggeseth. Classification of Object-OrientedComponents for Reuse. Proceedings of the Conference on Technology of Object-Oriented Languages and Systems (TOOLS 7). Dortmund.Germany. 1992.[10] T. Kohonen. Self-organized formation oftopologically correct feature maps. BiologicalCybernetics 43. 1982.[11] T. Kohonen. Self-Organization and AssociativeMemory (3rd edition). Springer. Berlin. 1989.[12] T. Kohonen. The Self-Organizing Map.Proceedings of the IEEE 78(9). 1990.[13] C. W. Krueger. Software Reuse. ACMComputing Surveys 24(2). 1992.[14] Y. S. Maarek and F. A. Smadja. Full TextIndexing Based on Lexical Relations AnApplication: Software Libraries. Proceedings of the 12th Int’l ACM SIGIR Conf. on Research and Development in Information Retrieval. 1989.[15] Y. S. Maarek, D. M. Berry, and G. E. Kaiser. AnInformation Retrieval Approach ForAutomatically Constructing Software Libraries. IEEE Transactions on Software Engineering
منابع مشابه
A New Document Embedding Method for News Classification
Abstract- Text classification is one of the main tasks of natural language processing (NLP). In this task, documents are classified into pre-defined categories. There is lots of news spreading on the web. A text classifier can categorize news automatically and this facilitates and accelerates access to the news. The first step in text classification is to represent documents in a suitable way t...
متن کاملDocument Analysis And Classification Based On Passing Window
In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...
متن کاملLearning Document Image Features With SqueezeNet Convolutional Neural Network
The classification of various document images is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for...
متن کاملA Joint Semantic Vector Representation Model for Text Clustering and Classification
Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...
متن کاملA Document Weighted Approach for Gender and Age Prediction Based on Term Weight Measure
Author profiling is a text classification technique, which is used to predict the profiles of unknown text by analyzing their writing styles. Author profiles are the characteristics of the authors like gender, age, nativity language, country and educational background. The existing approaches for Author Profiling suffered from problems like high dimensionality of features and fail to capture th...
متن کاملA New Approach for Text Documents Classification with Invasive Weed Optimization and Naive Bayes Classifier
With the fast increase of the documents, using Text Document Classification (TDC) methods has become a crucial matter. This paper presented a hybrid model of Invasive Weed Optimization (IWO) and Naive Bayes (NB) classifier (IWO-NB) for Feature Selection (FS) in order to reduce the big size of features space in TDC. TDC includes different actions such as text processing, feature extraction, form...
متن کامل